ABSTRACT
Investment in Africa over the past year with regards to SARS-CoV-2 genotyping has led to a massive increase in the number of sequences, exceeding 100,000 genomes generated to track the pandemic on the continent. Our results show an increase in the number of African countries able to sequence within their own borders, coupled with a decrease in sequencing turnaround time. Findings from this genomic surveillance underscores the heterogeneous nature of the pandemic but we observe repeated dissemination of SARS-CoV-2 variants within the continent. Sustained investment for genomic surveillance in Africa is needed as the virus continues to evolve, particularly in the low vaccination landscape. These investments are very crucial for preparedness and response for future pathogen outbreaks.
ABSTRACT
The Public Health Alliance for Genomic Epidemiology (PHA4GE) (https://pha4ge.org) is a global coalition that is actively working to establish consensus standards, document and share best practices, improve the availability of critical bioinformatic tools and resources, and advocate for greater openness, interoperability, accessibility and reproducibility in public health microbial bioinformatics. In the face of the current pandemic, PHA4GE has identified a clear and present need for a fit-for-purpose, open source SARS-CoV-2 contextual data standard. As such, we have developed an extension to the INSDC pathogen package, providing a SARS-CoV-2 contextual data specification based on harmonisable, publicly available, community standards. The specification is implementable via a collection template, as well as an array of protocols and tools to support the harmonisation and submission of sequence data and contextual information to public repositories. Well-structured, rich contextual data adds value, promotes reuse, and enables aggregation and integration of disparate data sets. Adoption of the proposed standard and practices will better enable interoperability between datasets and systems, improve the consistency and utility of generated data, and ultimately facilitate novel insights and discoveries in SARS-CoV-2 and COVID-19.